Considering Error Propagation in Stepwise Polynomial Regression

نویسندگان

  • Neima Brauner
  • Mordechai Shacham
چکیده

The selection of an optimal regression model comprising linear combinations of various integer powers of an independent variable (explanatory variables) is considered. The optimal model is defined as the most accurate (minimal variance) stable model, where all parameter estimates of the orthogonalized explanatory variables are significantly different from zero. The potential causes that limit the number of terms that can be included in a stable regression model are investigated using two indicators, which measure signal-to-noise ratios in the variables. The truncation-to-noise ratio indicator is used to measure the extent of collinearity between the explanatory variables and the correlation-to-noise ratio indicator to evaluate the significance of the correlation between an explanatory variable and the dependent variable. It is shown that the number of terms that can be included in a stable polynomial model (and its accuracy) depend on the range and precision of the data, the rate of the error propagation during computations, and the algorithm used to calculate the regression parameters. It is demonstrated that it can often be advantageous to include nonconsecutive powers of the independent variable in an optimal polynomial model. An orthogonalized-variable-based stepwise regression procedure is presented, which enables identifying the optimal model in polynomial regression.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Orthogonal Bases for Polynomial Regres- Sion with Derivative Information in Uncer- Tainty Quantification

We discuss the choice of polynomial basis for approximation of uncertainty propagation through complex simulation models with capability to output derivative information. Our work is part of a larger research effort in uncertainty quantification using sampling methods augmented with derivative information. The approach has new challenges compared with standard polynomial regression. In particul...

متن کامل

APPLICATION OF EVOLUTIONARY POLYNOMIAL REGRESSION IN ULTRAFILTRATION SYSTEMS CONSIDERING THE EFFECT OF DIFFERENT PARAMETERS ON OILY WASTEWATER TREATMENT

In the present work, the effects of operating conditions including pH, transmembrane pressure, oil concentration, and temperature on fouling resistance and the rejection of turbidity for a polymeric membrane in an ultrafiltration system of wastewater treatment were studied. A new modeling technique called evolutionary polynomial regression (EPR) was investigated. EPR is a method based on regres...

متن کامل

A software toolbox for data analysis and regression, considering data precision and numerical error propagation

An algorithm for data analysis and regression by orthogonalized-variable-based stepwise regression (SROV) has been developed and was implemented as a MATLAB toolbox. The program uses QR decomposition based on Gram-Schmidth orthogonalization, which is highly resilient to numerical error propagation, for regression. Variables are selected to enter the regression model according to their level of ...

متن کامل

Considering precision of experimental data in construction of optimal regression models

Construction of optimal (stable and of highest possible accuracy) regression models comprising of linear combination of independent variables and their non-linear functions is considered. It is shown that estimates of the experimental error, which are most often available for engineers and experimental scientists, are useful for identifying the set of variables to be included in an optimal regr...

متن کامل

Identifying and removing sources of imprecision in polynomial regression

Identification and removal of imprecision in polynomial regression, originating from random errors (noise) in the independent variable data is discussed. The truncation error-to-noise ratio (TNR) is used to discriminate between imprecision dominated by collinearity, or numerical error propagation, or inflated variance due to noise in the independent variable. It is shown that after the source o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999